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REMARKS 

Reconsideration and entry of the present amendment after final is respectfully 
requested. 

Status of the Claims : 

Claims 2-8, 12, 14, and 18-38 were pending after submission of 13 June 
2003. Claims 9-11 have been cancelled in the current amendment, they were 
previously withdrawn from examination. Applicants reserve the right to pursue the 
contents of these claims in continuing applications. Claims 2-8, 12, 14. and 18-38 
remain under examination after amendment Claims 12, 14, and 20 have been 
amended regarding the encoded polypeptide, the phrase "involved in DNA double 
strand break repair" has been replaced with the phrase "binds to a MRE11 
polypeptide". Support for the amendments is found in the claims as originally filed, 
and throughout the specification, particularly page 1 , lines 30-32. No new matter 
has been added. 

Rejections under 35 U.S.C. §112, 1 st Paragraph - Enablement : 

Claims 2-8, 12, 14, and 18-38 are rejected under 35 U.S.C, §112, first 
paragraph as failing to comply with the enablement requirement. The rejection is 
repeated for the reasons of record as set forth in the Office Action of 1/13/03. 

The Action asserts, briefly, that given the complexity of DNA repair, including 
the involvement of other proteins, it is unclear what agronomic benefit a transgenic 
plant with modulated RadSO levels would have, and unclear how modulating the 
level of RadSO would affect DNA repair or polynucleotide integration. Therefore the 
Action concludes that one skilled in the art would not know how to use SEQ ID NO: 
1, and sequences having 90% and 95% to the disclosed sequences, without undue 
experimentation. 

Applicants respectfully disagree, for the reasons of record {e.g., see response 
filed 6/13/03), some of which may be reiterated for clarity. Applicants note that 
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claims 12, 14, and 20 have been amended, as described above, to recite 

polypeptides that "bind to a MRE1 1 polypeptide". 

Applicants have provided full-length sequences, demonstrated conserved 

functional motifs (e.g., page 2, lines 17-24, and Example 4), extensive overall 

homology, direction regarding nucleotide and amino acid substitutions, and percent 

identity comparisons (e.g., Appendix A and D; page 5, line 20 — page 6; and line 25; 

page 16, line 1 - page 20, line 20 of the specification), Rad50 function in DNA break 

repair and interaction with Mre11 (e.g., page 1, line 15 - page 2, line 14), vector 

construction, plant transformation, and methods to modulate the level of Rad50 

(e.g., page 37, line 5 - page 42, line 22; page 44, line 28 - page 47, line 25; page 47, 

line 28 - page 49, line 26; page 50, line 29 - page 52, line 31 ; and page 53, line 1 - 

page 54, line 6). The Examiner is reminded that the need for routine 

experimentation and screening is not considered undue (see MPEP 2164.01): 

The test of enablement is not whether any experimentation is 
necessary, but whether, if experimentation is necessary, it is undue. In 
re Angstadt, 190 USPQ 214 (CCPA 1976). An extended period of 
experimentation may not be undue if the skilled artisan is given 
sufficient direction or guidance. In re Cofianni, 195 USPQ 150 (CCPA 
1977) (Miller, J M concurring). The experimentation required, in addition 
to not being undue, must not require ingenuity beyond that expected of 
one of ordinary skill in the art. In re Angstadt supra. For example, in 
one instance a "few hours" of experimentation to determine process 
parameters was not considered to be undue in view of the nature of 
the invention (preparation of oxygenated hydrocarbons). In re 
Borkowski, 164 USPQ 642 (CCPA 1970). In Tabuchi v. Nubel, 194 
USPQ 521 (CCPA 1977) a screening procedure which took 15 
calendar days was not considered undue experimentation because the 
test was both simple and straightforward and because of its 
demonstrated success in producing the desired result. 

Further, the Examiner has not given specific reasons, or provided specific 
citations to support the conclusions that modulation of the level of Rad50 would not 
have an effect on the plant 

Modulation of Rad50, thereby modulating the level of DNA repair In the plant, 
is predicted to increase the efficiency of incorporation of heterologous nucleic acids 
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in the genome of a plant For the record, Applicants disagree with the Examiner's 
conclusion that modulation of RadSO must result in an agronomic benefit to the 
transgenic plant. Applicants utility, to use RadSO to modulate the efficiency of 
incorporation of a transgene into the plant genome (page 2, lines 10-14) does not 
require that RadSO confer any particular agronomic phenotype to the plant. 

Applicants note that they have provided another component of the DNA repair 
complex. Mre11, in U.S. Patent 6,646,182, issued Nov. 11, 2003, thereby 
demonstrating the presence of Mre1 1 in plants. Also, Gallego et al. have 
demonstrated Rad50 involvement in DNA repair and meiosis in Arabidopsis (Plant J. 
25:31-41 2001, submitted in Appendix C). Applicants have modified the Multiple 
Sequence Alignment (Appendix A) to include the conserved motifs shown in 
Example 4 of the specification, currently presented as Appendix D. Applicants note 
that the alignment is similar to the one shown by Gallego et al. (supra), which shows 
higher conservation of the N- and C-terminal regions of RadSO. Appendix D also 
includes the results of an analysis performed using the Lion BioScout software 
package, and Hmmerpfam search using GCG software, both of which detect the 
known RadSO Zn-hook Pfam domain (e = 6.9e-07). 

Applicants assert this support fully enables one of skill in the art to make and 
use the full-breadth of the invention without undue experimentation. Absent of any 
evidence from the Examiner supporting the conclusion that modulation of RadSO 
would not effect the plant, Applicants believe this rejection cannot be supported. 
Therefore Applicants respectfully request reconsideration and that the rejection of 
claims 2-8, 12, 14, and 18-38 under 35 U.S.C. §112, first paragraph for lack of 
enablement be withdrawn. 

Rejections under 35 U.S.C. §112. 1 gt Paragraph. Written Description : 

Claims 2-8, 12, 14, 20 and 23-38 are rejected under 35 U.S.C. §1 12, first 
paragraph, as containing subject matter not sufficiently described in the specification 
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to indicate the inventor(s) had possession of the invention. The rejection is repeated 
for the reasons of record as set forth in the Office Action of 1/13/03. 

Briefly, the Action asserts that the Applicant has not described a 
representative sample of the genus, and that the conserved domains are not unique 
to Rad50 proteins, but are common to all DNA repair proteins. The Action also 
asserts that Example 14 of the Revised Interim Written Description Guidelines is not 
applicable because claims drawn to polynucleotides having 95% sequence identity 
are not rejected in this Action. 

Applicants respectfully disagree, for the reasons of record (e.g., see response 
filed 6/13/03), some of which may be reiterated for clarity. As is stated in the MPEP 
2163 (see p. 2100-168) the written description for a claimed genus may be satisfied 
by "disclosure of relevant, identifying characteristics, i.e., structure or other physical 
and/or chemical properties, by functional characteristics coupled with known or 
disclosed correlation between function and structure, or by a combination of such 
identifying characteristics, sufficient to show that the applicant was in possession of 
the claimed genus." Applicants respectfully submit that the present application 
meets this standard by the disclosure of the structures of the full-length Rad50 
polynucleotide (SEQ ID NO: 1), which encodes a full-length Rad50 polypeptide 
(SEQ ID NO: 2) which have the structural/chemical properties of significant 
sequence identity to known Rad50 polynucleotides and polypeptides, conserved 
domains (see, for example, Example 4 and the Multiple Sequence Alignment), and 
the functional characteristic of Rad50 polypeptide binding to MRE11 polypeptides. 

Regarding the applicability of Example 14 r for the record Applicant notes that 
while claims 18 and 21, directed to sequences having 95% identity to either SEQ ID 
NO: 1 or SEQ ID NO: 2, are not included in this rejection, they are also not allowed. 
Also, the Examiner has not made clear why Example 14 does not apply, other than 
focusing on the specific percent identity recited and not the substantive features of 
the structural and functional elements. Applicants maintain that the current claims, 
including those directed to 90% sequence identity, meet the Written Description 
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standard as set forth in the 35 U.S.C. §112, described MPEP, and shown in 
Example 14 of the Revised Interim Written Description Guidelines. 

Applicants have coupled structural, chemical, and functional properties to 
describe polynucleotides having 90% and 95% sequence identity to SEQ ID NO: 1, 
or encoding polypeptides that have 90% and 95% sequence identity to SEQ ID NO: 
2 such that a person skilled in the art can envisage the claimed invention, thereby 
meeting the 35 U.S.C. §112, first paragraph written description requirement. 
Therefore Applicants respectfully request reconsideration and that the rejection of 
claims 2-8, 12, 14, 20 and 23-38 under 35 U.S.C. §112, first paragraph for lack of 
written description be withdrawn. 



In light of the foregoing remarks and amendments, it is believed that claims 2- 
8, 12, 14 and 18-38 are in condition for allowance. Withdrawal of the outstanding 
rejections and allowance of all of the remaining claims is respectfully requested. 
The Examiner is invited to telephone the Applicants representative to expedite 
prosecution and allowance. 



PIONEER HI-BRED INTERNATIONAL, INC. 

Corporate Intellectual Property 

7100 N.W. 62* Avenue 

P.O. Box 1000 

Johnston, Iowa 50131-1000 

Phone: (515)270-4192 

Facsimile: (515)334-6883 



CONCLUSION 



Respectfully submitted, 




Virginia Dress 
Agent for Applicant(s) 
Registration No. 48,243 
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Summary 

The Rad50 protein is involved in the cellular response to ONA-double strand breaks (DSBs). including the 
detection of damage, activation of cell-cycle checkpoints, and DSB repair via recombination, ft is 
essential for meiosis in yeast, is involved in telomere maintenance, and is essential for cellular viability 
in mice' Here we present tho isolation, sequence and characterization of the Arabidopsis thaliana HAD50 
homologue [AtRADSO) and an Arabidopsis mutant of this gene, A single copy of this gene is present in 
the Arabidopsis genome, located on chromosome II, Northern analysis shows a single 4.3 Kb mRNA 
species in all plant tissues tested, which is strongly enriched in flowers and other tissues with many 
dividing cells. The predicted protein presents strong conservation with the other known Rad&O 
homologues of the amino- and carboxy-terminal regions. Mutant plants present a sterility phenotype 
which co-segregates with the T-DNA insertion. Molecular analysis of the mutant plants shows that the 
sterility phenotype is present only In the plants homozygous for the T-DMA insertion. An in vitro mutant 
cell line, derived from the mutant plant, shows a clear hypersensitivity to the DNA-dam aging agent 
methyl methane sulphonate. suggesting a role of RAD50 in double-strand break repair in plant cells. This 
is the first report of a plant mutated in a protein of the Rad50-Wlre11-Xrs2 complex, as well as the first 
data suggesting the involvement of the Rad50 homologue protein in meiosis and DIMA repair in plants. 

Keywords: Arabidopsis, RADSO, DSB repair, sterility, recombination. 



Introduction 

The repair of DNA double-strand breaks IDSBs) involves 
genetic recombination. Fundamentally, two different 
forms of recombination are involved: thoso involving 
DNA sequence homology between the participating DNA 
molecules, and those that appear to act independently of 
such homology, called homologous recombination (HR) 
and non-homologous end-joining (NHEJ), respectively. 
Considerable advances in the understanding of these 
mechanisms have been made In recent years, both in 
terms of the proteins involved and of their relative 
contribution to repair of DSBs in different organisms 
and/or cell types. These issues are the subject of a number 
of recent reviews (Fox and Smith, 1998; Jeggo, 1998; 
Paques and Haber, 1999; Petrini etaL, 1997; Roeder, 1997; 
Smith, 1998; Smith and Nicolas, 1998; Tsukamoto and 
Ikeda, 1998). 

©2001 Blackwell Science Ltd 



Much of our understanding of the proteins implicated in 
the process** of recombination and DSB repair comes 
from studies of the yeast Saocharomyces earavisiaa* 
Homologous recombination in yeast cells is largely 
under the control of the genes in the RADS2 eplstasis 
group, which include RADS0-SS t XRSZ and MRE11 
(Ajimura at ah, 1993; Gome, 1993; Hayn&a and Kurtz, 
1981). These genes were mostly identified as being needed 
for the repair of ionizing radiation-induced DNA damage, 
and are also needed for meiosis. Deletion of these genes in 
yeast generates defects in both recombination and DNA 
repair with differing phenotypes in terms of recombination 
and the repair of DNA double-strand breaks. Mutants of 
radSI, radS2 r radS4, radSS, rad57 and radSS show defects 
in HR, while radSO and mm11 mutants have shown defects 
in NHEJ and are hyper-rec for HR, 
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Double-strand breaks are repaired by both HR and NHEJ 
in all cell types examined; however, the preferred mech- 
anism of recombinattonal DNA repair differs significantly 
in a given organism. In particular, it is clear that non- 
homologous recombination is much more frequent than 
homologous recombination in mammalian and ptant cells, 
whereas yeast cells rely almost; entirely on homology- 
based recombi national DNA repair. The basis of the strong 
preference for illegitimate recombination in mammalian 
and plant cells is not fully understood- Both pathways have 
a common initial substrate (DSB), the processing of which 
may be channelled into either HR or NHEJ. This channel- 
ling is apparently determined by the initial metabolism of 
the DSB itself, and it has recently been suggested that it 
may result from a competition for binding the DNA ends 
between the Ku and Rad52 proteins (VanDyck etal., 1999}. 
In this context, the Rad50/Mre11/Xrs2 complex is of 
particular interest, as mutants show a weak hyper-HR 
and strong hypo-NHEJ phenotype in mitotic yeast cells 
(Boulton and Jackson, 1998; Moore and Haber, 1996; 
Schiestl etaL, 1994; Tsukamoto 9tai r 1996; Tsukamoto 
era/., 1997). However, in some assays RADSO ts needed for 
intrachromosomal recombination (Elias-Arnanz etal, 
1996; Rattray and Symington, 199S; Tran etal., 1995). 
RadSCT mutants show altered processing of DNA ends 
during recombination (Ivanov era/., 1994; Sugawara and 
Haber, 1992). This complex has also been implicated in the 
regulation of the response of yeast cells to DNA damage 
[Kiromani and Muniyappe, 1997; Lee ef aL, 1998). 

Homologues of many of these yeast genes have been 
cloned from different organisms (see reviews by Kanaar 
and Hoeijmakers, 1997; Paques and Haber, 1999), includ- 
ing RADSO genes from $. cerevisiae (Alani etaL 1989); S. 
pombe (Saunders, 1999); man (Dolganov etaL. 1996): 
mouse (Kim era/., 1996); and Caenorhabditis efegans 
(Often berg and Heyting, 1996). RadSO shows homology 
to the Escherichia coVt SbcC protein (Sharpies and Leach, 

1995) , and has been shown to form a complex with the 
Mre1l r Xrs2 and Lig4 proteins. The RadSO/Mrell complex 
is involved in DSB repair via NHEJ in mammalian cells, 
and the null mouse radSO mutant Is in viable both in 
cultured ES cells and in developing mouse embryos (Luo 
etal., 1999). This cclWcthal phenotype has also been seen 
in mouse mreH mutants (Xiao and Weaver, 1997), as well 
as chicken (Sonoda etaL 1998) and mouse radSI mutants 
(Lim and Hasty, 1996; Sharan etal, 1997; Tsuzuki etaL, 

1996) . Interestingly, rad$2 mutant mice are viable and 
show increased radioresistance (Rijkers etaL 1998; 
Yamaguchi-lwai eta!., 1998). 

A number of radiation-sensitive plant mutants have 
been isolated in recent years (reviewed by Britt, 1999; 
Gorbunova and Levy, 1999; Men 9 isle and Paszkowski, 
1999). The mutated genes of methyl methane sulphonate 
(MM 5) and UV-hypersensitive Arabidopsfs mutants have 



recently been identified: a ribosomat SZ7 protein 
(Revenkova eta/., 1999); the Arabidopsis Rad1 homologue 
protein {6a I lego etal, 2000; Liu etaL, 2000); and a member 
of the structural maintenance of chromosomes protein 
family (SMC; Mengiste etaL, 1999; Strunnikov, 1998; 
Strunnikov etaL, 1993). RadBO is also a member of this 
family. Plant homologues of RAD51 (DouLriaux etaL, 1998; 
Smith etal t 1996) and MRE11 (Hartung and Puchta, 1999) 
have been isolated, and the rapid progress of the 
Arabidopsis sequencing project makes it likely that many 
other genes implicated in these processes will be isolated 
in the near future. 

As part of our investigation of the control of the early 
events of DSB repair and recombination in plants, we have 
isolated the Arabidopsis thaiiana homologue of the RADSO 
gene, and here we describe the isolation, sequencing and 
preliminary characterization of this gene. Furthermore, we 
have identified a radSO mutant plane that presents a 
sterility phenotype in agreement with the role of this 
protein in meiosis in yeast cells. We also show that 
homozygous mutant cell lines present hypersensitivity to 
MMS, suggesting (as has been shown in yeast cells) a role 
for RADSO in DSB repair in plants. This is the first evidence 
of the implication of the RADSO gene in meiosis as well as 
in D$B repair in plants- 
Results and Discussion 

77jc Arabidopsis thaliana RADBO homologue 

A 700 bp A. thaliana cDNA clone with homology to the 
carboxy- terminal region of the yeast RadSO gene was 
detected in a screen of the Arabidopsis cDNA expression 
library with an antibody against a peptide sequence from 
the cytoplasmic domain of vertebrate beta 1 integrin. 
Based on this sequence (a kind gift from P. Nag pal and R- 
Quatrano), oligonucleotides were designed and a frag- 
ment of 1.5 kb of genomic sequence was amplified from 
genomic DNA prepared from an Arabidopsis cell-suspen- 
sion culture. This DNA fragment was used to screen a 
genomic lambda bank prepared with DNA from the same 
cells, permitting the identification of two lambda clones 
with overlapping inserts spanning approximately 22 kb of 
genomic DNA and including the Arabidopsis RADSO 
homologue locus. Sequencing of 10 kb from the inserts 
of these clones showed that they spanned the entire 
AtRADSO gene (data not shown). Our genomic DNA 
sequence has recently been confirmed by (and is identical 
to that of) the Arabidopsis genome sequencing project (Lin 
era/., 1999). 

The cDNA encoding the Arabidopsis RADSO homologue 
was isolated by RACE-PCR with the Marathon Kit 
(Clontech Inc., Palo Alto, CA, USA), Poly(A)+ RNA was 
prepared from total RNA isolated from the A thatiana cell 

© Btackwell Science Ltd, The Plant Journal, (2001), 25, 31-41 
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suspension; double-stranded. cDNA was synthesized and 
adapters wore Hgated to the two ends. This library of 
adapter-ligated cDNA was PCR amplified using a RADSO 
specific primer and primers against the adapter 
sequences. A PCR fragment of 4.3 kb was isolated on 
agarose gel and cloned into pG£M-Teasy (Promega Inc., 
Madison, Wl, USA). Sequencing of the 4305 bp insert 
clone confirmed that it includes the entire AtRADSO cDNA, 
and that this cDNA corresponds to the genomic clones 
(above). The gene covers 8486 bp and contains 27 exons; 
all 5' donor sites contain the AG/GT conserved function; 
and the conserved AG is present in all the 3? acceptor sites. 
We have submitted this mRNA sequence to GenBank 
(Gallego etai, 1539). The computer prediction of the 
mRNA sequence of this gene from the genome sequen- 
cing project (Lin era/., 1999) misplaces a number of the 
intron-exon borders, and thus predicts a protein with 53 
inserted and 32 deleted amino acids relative to that 
predicted from our cDNA sequence. 

Amino acid comparison of Rad50 homologucs. 

The cloned cDNA from Arabidopsis presents a putative 
methionine initiation codon at nucleotide residua 146 in 
the first exon. No other open reading frames were 
detected. The termination codon of the open reading 
frame is located at nucleotides in position 4097, giving a 
predicted protein of 1316 amino acids. A 3'-untranslated 
region covers 181 nt before the short poIy(A) tail present in 
the cDNA clone. The length of the protein is strongly 
conserved in all organisms in which it has been studied: 
the human, mouse and yeast RadSO proteins have 1312 
amino acids, while the C. slogans Rad50 has 1298 and the 
Arabidopsis RADSO cDNA reported here predicts a protein 
of 1316 amino acids. Amino acid sequence comparison 
with the known Rad50 proteins shows a high conservation 
at the N- and C-terminal regions. In particular, the amino* 
terminal 190 amino acid region of the Arabidopsis protein 
has 56 and 52% sequence Identity with the yeast and 
human RadSO proteins, respectively. The carboxy-terminal 
207 amino acids show >62 and 52% identity with the 
corresponding region present in the yeast and human 
RadSO proteins, respectively (Figure 1a). The overall amino 
acid identity of the predicted Arabidopsis protein is of 
29.8% with the human and 27.3% with tho yeast proteins, 
respectively. The Arabidopsis RadSO protein predicted 
from the cDNA sequence contains 18.2% of acidic amino 
acids and 16.9% of basic residues. The protein is predom- 
inantly hydrophilic, with 30.3% of hydrophobic residues. It 
presents three glycosylation motifs in positions 396-399, 
419-422 and $20-623. A type-A ATP-binding side is present 
in the amino-terminal region at amino acid positions 34- 
41. A Walker 8 motif fs present at the carboxy-terminal 
region between amino acids 1235 and 1242. Checking of 

© Blackwell Science Ltd, The Plant Journal, (20O1), 25. 31-41 



Arabidopsis RadSO 33 

the predicted AtRadSO amino acid sequence for plant 
localization signals with the psort program (http://psort- 
nibb.ac.jp/), yields a putative nuclear localization signal 
with certainty of 0.350 (success). No chloroptast or 
mitochondrial localisation sequences were identified in 
the sequence by PSORT. Figure Kb) shows the output from 
the CO) 1,5 program (http://ulrftc3.unil.ch/software/ 
coils_FORM.html; Lupas etal, 1991). It can be seen that 
the predicted AtRad50 protein conserves the central 
coil ed-coi led conformation domain seen in tho yeast and 
human proteins (Dolganov etaL 1996). 

Chromosomal localization of Arabidopsis RAD50 locus 

Southern analysis of DNA prepared from Arabidopsis 
suspension cells show a unique band after digestion with 
six different restriction enzymes (Figure 2a), suggesting 
that the Arabidopsis RADSO gene is present as a single- 
copy gene within the A. thatiana genome. 

Using PCR amplification of DNA prepared from the 
Arabidopsis YAC bank (Creusot etai. 1995), we have 
mapped the RADSO gene on chromosome It (YAC C1C 
11E1L) near the marker TEnS (data not shown). Recently, 
genomic sequencing results of Arabidopsis chromosome 
11 BAQ F22D22 has confirmed this mapping (Lin etal., 
1999). 

Expression of the RADSO gene in Arabidopsis 

Northern analysis of RNAs prepared from Arabidopsis 
suspension cells as well as different plant tissues shows a 
single mRNA species, tho length of which corresponds to 
that of the cDNA (Figure 2b). This has been confirmed by 
Northern analysis of poly<A)+ RNA from suspension cells 
and flower buds, as well as by RT-PCR analysis (data not 
shownh 

Studies of human and mouse RADSO gene expression 
have shown the presence of alternative spliced transcripts 
(Kim etal., 1996; Kim era/,, 1999). We were unable to detect 
alternative transcripts in A thaliana using both Northern 
and PCR analysis. The Arabidopsis RADSO mRNA is 
expressed in all cell tissues analysed- however, stronger 
levels were found in fast growing cells such as cell- 
suspension culture, young primary roots and flowering 
structures. Tissue from flower buds and mature flowers 
contains a relatively high proportion of cells undergoing 
division, processes that appear to bo correlated with a high 
level of expression of RADSO, Maximum expression of 
human and mouse RADSO genes has been detected in the 
testis (Dolganov etal., 1996; Kim era/., 1996). Thus, 
although the transcript is enriched in meiotic tissues, 
expression of the AtRADSO gene is not specific to meiotic 
cells. This pattern of expression is reminiscent of that 
previously reported for the Arabidopsis RADS1 homologue 
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Figure 1. CornparfcwiA □( ArHadSO and other RaclSO proteins. 

fa) Alignment of RadSO homology c? from Awbidopsb (At), mon (H^) and 5. ccrevisixe (5c). identical residues between two or more sequences ere printed 
as white, uppercase letters outlined in black; conservative substitutions as block, lowercase \ otter} outlined in grey; non-conserved residues* ere in lower 
case on white. Numbers to the right of each line indicate the position in the amino add ftequencfr of the last residue on that tine. The soauences were 
aligned with tha clustalx program using the blosum weighting matrix (Thompson era/., 1994), 

(bl Comparison of pm#cted coilcrl-cnft .itrurlurft of ihe Arabidopxts (131 Baa) r human 03 12 Aft) end S. cer&v'tsioo (1312aa) RodSO proteins. Output of the 
COILS program <http;//ulrsc3*unit.ch/software/coils.FORM,hlmt) with window width - 2B; (.upas eta/.. 1991). 

© Blackwcll Science Ltd, Tho Plant Journal, (2001), 25, 31-41 
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(Doutriaux eta!., 1998) arid those of the UVHI/AtftADI 
(Gallego etaL, 2000; Liu etaL, 2000) and UVR2 (Ahmad 
et ai, 1997) genes Of Arabtdopsis, 

As reported for the human RadSO protein following, 
treatment with ionizing radiation {Dolganov etaL, 1996}, 
the steady-state level of AtRAOSO mRNA does not change 
in cultured cells In response to exposure to the radio- 
mimetic compound MMS (data not shown). This contrasts 
with the situation reported for mouse rW>50gene expres- 
sion, which does increase after treatment of NIH-3T3 cells 
with MMS (Kim et&L r 1996). 



identification and characterization of a rad50 mutant of 
Arabtdopsis 

Based on our AtRADSO sequence, we designed oligonu- 
cleotides and screened the Versailles Arabtdopsis T-DNA 
insertion mutant collection (Bechtold ef at, 1993; Bouchez 
ctat. t 1993) using PCR as described by Gaelen etaL (2000). 
From this screen we identified a single plant containing a 
T-DNA insertion in the RAD50 locus. The mutant plant is 
fully sterile, producing numerous flowers and small, 
empty siliques (Figure 3). 

The mutant phenotype and the kanamycin resistance 
marker of the inserted T-DNA co-segregate, with seHed 
haterozygotes producing progeny with the 3 : 1 ratio 
expected for a single-locus insertion (128 non-mutant : 38 
mutant; y} r 1 df = 0.39). Southern analysis has confirmed 
that the T-DNA insertion is present at a single locus (not 
shown), and PCR results indicate the single-copy nature of 
the insertion (see the size of the amplified fragment 
present in the homozygous plants, Figure 4). Molecular 
analysis shows that the 38 plants presenting the sterility 
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phenotype are, as expected, homozygous for the T-DNA 
insertion. Thus there Is clear linkage between the T-DNA 
insertion and the sterility phenotype of the mutant tad5Q 
allele, which is recessive. Furthermore, we have also 
observed a sterility phenotype in plants expressing 
A(RAD50 antisense RNA (not shown). These results sug- 
gest a role for the RadSO protein in meiosis in plants which 
is consistent with that known in yeast. Due to the celMethal 
phenotype of the rad50 null mutant in mouse, this 
constitutes the first implication of RADSO in meiosis in a 
metazoan organism. 

Mutant plants which have been germinated on agar 
medium before being transferred to soil are reduced in size 
in comparison to their non-mutant siblings; however this 
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Figure 2. South am and North cm analysis of AtRADSb 
(ol Southern analysis ot ON A fram Arsbid6p*ia cell eutturo probed with 
the AtflADSO genomic proba. From l&ft to rioht: feoftU Bytfh, 
fcoRV-, XM' and Apd -digested ON A. SI«s of the detected fragments 
an* shavun to the Left. 

(b> Northern analysis nf ratal Arsbldopsls RNA from different tissues, 
probed with tha ? half of tha AtRADSO cDNA (upper) and the t85 rONA 
proba as a loading control (lower). Lanes: 1, suspension eel It; 2, young 
rosette lumrea grown for 3 weeks; 3, laavos from stems ot moiuru 
flowering plants; 4, rosette leave? fram npn-llowering plants grown for 
6 weeks with short daylength (8 h licht period); 5. mature flowem; 6, 
flower buds; 7, roots: B. sterna. 
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Figure & MM 5 sensitivity 0/ retf So mutant celts. 
RsureS. The radSOnurtom is sterile. Mutant t/«fSafrt>rfSP upper on each plate) and heterozygote tfl/UWa/ 

holographs of 7-wsek-old mutant (radSGtotfCT »n* hctcrwgom r ^ l^r on tach plate) susparision wIMincs nrown In Petri d«h» o-t 
irodSOmAOSOt plants- The mutnM plants produce many flowara and ^ mcdium corrtD i n i ng dillcrcht corteemrstidAa erf MMS: <a) no MMS: 
small, wnpty siltques. , b) (^0033^ (c) 0.0067%; |d> 0,01%; (e> 0.0133%; (ft 0.01&7% MMS. 



phenotype is not seen if seeds arc germinated directly in 
soil in the greenhouse. This growth difference is Seen 
whether or not antibiotic (kanamycin) is included in the 
agar medium, and is thus not an artefact caused by the 
antibiotic. Further studies will be needed to determine the 
exact origin of this facultative growth defect. 

The mutated radSO gene and its inserted T-DNA were 
PCR-amplified, cloned, and the plant DNA/T-DN A junctions 
sequenced to determine the exact position and structure of 
the insertion (Figure 5). The T-DNA is inserted into the 
coding sequence of the gene and causes disruption of the 
RADSO ORF in the 21st exon, after the 1050th codon (of 
1316). Thus any mutant protein produced would lack the 
266 C-terminal amino adds of the wflri-typo protein. 

The inserted T-DNA has a small, 141 nucleoids, inverted 
duplication of the LB sequence upstream of the RB, and 
has lost the first 10 nucleotides foil owing the RB nick site. 
The inverted orientation of this LB DNA implies the 
participation of a replicated T-ONA moleculB in the origin 
of this sequence, and that It id not simply the result of 



ligation-derived concatamers of the single-stranded trans- 
forming T-DNA molecules. The beginning of the inser- 
tion (inverted LB sequence} has a 12 bp (with three 
mismatches) homology to the RADSO sequence at that 
position. The IB junction is much simpler, with the T- 
DNA sequence ending 16 nt downstream of the LB nick 
site, followed by the insertion of two As prior to 
resumption of the RAD50 gene sequence. The Insertion 
caused an 13 bp deletion (not including the 12 bp 
homology, above) of RADSO sequence. Neither of the 
two junctions occurs exactly at the nick site of the 
corresponding border sequence. We have previously 
seen similar structures at T-DNA insertion sites (unpub- 
lished results), and the mechanism of T-DNA integration 
has been reviewed by Tinland (1996). The systematic 
sequencing of the DNA flanking T-DNA insertions of the 
Versailles mutant collection is currently under way. and 
understanding of these events will greatly benefit from 
this rich source of data. 

© Blackwcll Science Ltd, The Ptant Journal, (2001), 25, 31-41 
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The Arabidopsis radSQ mutant is hypersensitive to MMS 

In yeast cells, mutation of the RAO50 gene causes hyper- 
sensitivity to X-ray irradiation and to particular radio- 
mimetic DNA-damaging chemicals such as MMS. 
However, due to the cell-lethality of tadSO mutants in 



Heterozygotes Homozygoies 




Bgura 4. Tho m^50 mutam focus contains a ilngte T-DNA insertion. 
PCft of individual plants sea reflating Toted ON A was amplified 

with ,t pair of primers in the ftADSO sequence, spanninjj the T-DNA 
insertion sit 9 (S'-GAGCTGTGAAGCTACAAAGAATGAACTTGCa6<5TC; S'- 
CCCATCCAGCTTTGTAGTTGK Homozvgote plants show only tho 8 kb 
PGR product expected for a single-copy T-DNA insertion, while 
heterozygotes also show tho expected 1,5 kb tond for th« wild-type 
FAD5Qqcntt. 
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mouse, a clear demonstration of this radiosensitivity has 
not previously been possible in a metazoan organism (see 
Lug era/., 1999 for work on gamma-irradiated early mouse 
blastocyst explants). 

We and colleagues (J. Paszkowski. personal communi- 
cation) have observed that the I etna I effects of MMS on 
Arabidopsis seedlings are dependent on the size of the 
plants tested. Mon -ambiguous determination of the MMS 
concentration needed to kill a given plant line (or other 
multicellular organism) Is thus complicated by questions 
of the size of plants used. In order to avoid this problem, 
we therefore tested MMS sensitivity on cell lines derived 
from these plants. In this way, very small clumps 
(mfcrocalluscs) of growing cells are exposed to MMS, 
and a much clearer determination of dose-dependent 
MMS sensitivity is possible. Callus was induced on 
young leaf tissue, and suspension Cultures initiated using 
standard protocols (see Experimental procedures). 
Suspension cells in liquid culture were pipetted onto 
solid growth media with or without different con cent ra- 
tions of MMS, and their growth scored visually 2-3 weeks 
later (Figure 6). In the absence of MMS the rad50 mutant 
and heterozygote cell lines had both grown to similar 
extents, whilst the radSO mutant cells showed a clear 
hypersensitivity to MMS relative to the heterozygote cell 
line. We have also observed MMS sensitivity In mutant cell 
lines generated via the antisense approach (data not 
shown). These data suggest that the RadBO protein plays 
a role in DSB repair in plant cells, as is the case in yeast. 



(a) Right: border junction 

RAP50 AJUU3TTGIU5GACCgJlkCftCCT lg^ 

MUTANT GTCGClCCTGGCAClCGa^^ 

LB-RB CGGCACCrGGC ft CTVCtXXXrrG<^GTCa5<5CCCO i f ICCJ^TTCASG<^rUl\:rACGCTTAGATGA 

HB^LB n rn rvy« 1 fcfeaTTAAACG 



(b) Left border junction 

KtJTANT CACI&CAATCGATraCCTCATTT^ 
RB-LB GAGXAC&&TCGATT<7CCCTCATTTT^ 

RADSO i gagcaaatATTGAATATTGGTGGGATT^ 

1 nick sit* 



(C) Mutant protein 



IAC CGG AGA ACA AAA GCA AAA GTT GAG- GAG CTT ACA CGT 
ZRTTKAKVBEI.TR 



AAA AAT GAG GGG AAT CCA TTG TAG TGA ATCl 



aalOSO 



3 



figure Sequences of junctions of the inserted T-ONA. 

RAQ50 junevtons Ub) and the predicted effect on the RadSO protein (c), Tho mutant rad50 genomic DNA seauence is aligned with the sequences of the 
ends of the TOrVA used to generate (hp Arobicfopr.ix mutant^ and with the RADSO genomic sequence. RA05Q sequence deleted at the point of the 
fnsoriion is boxed (a) and the RB-L& sequence shown begins immediately nUnt the RB rt!clf (b). In frame codinrj ON A (upper* ond protein Mower) uequene* 
added to tho my mm rndSQ open reading frame by the insertion is shown boxed in tcK 

© Biockwcil Scionco Ltd, The Pisnt Journal, (2001), 25, 31-41 
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The Arabidopsis rad50 mutant is viable 

To ovr knowledge, the only other report of a radSO" mutant 
in a metazoari organism describes a 'cell-lethal' phenotype 
in a mouse rad5(T mutant (Luo el ah, 1399), These authors 
were unable to isolate homozygous radS0~ mutants of 
mouse ES cells in culture, and ascribed the early embry- 
onic lethality of homozygous radSQ/radSO mice to a 
gradual cession of proliferation of the embryonic cells. A 
differential radiosensitivity was also seen in mutant 
blastocyst ex plants, and the authors tentatively concluded 
that the cell lethality is caused by the accumulation of 
unrepaired and/or misrepeired DNA DSBs. indicating 
involvement of mouse Rad50 protein in DSB repair/ 
signalling. 

We show here that the Arabidopsis radSCT mutant is 
hypersensitive to the radiomimetic agent MMS, and thus 
presumably defective in DSB repair; however we have no 
evidence for a strong effect on the proliferative capacity of 
radSff Arabidopsis cells. Under certain growth conditions 
the mutant plant is stunted, but the facultative nature of 
this phenotype clearly indicates the lack of a baste problem 
at the cell division level. As the mutant plant is fully sterile, 
effects on cellular proliferative capacity can only be 
studied in the context of a single plant generation. Thus 
we have derived mitotic cell cultures from the mutant 
plants in order to be able to study long-term effects. These 
cells grow, and we are undertaking further analysis on the 
involvement of the RADSO function in ON A repair/recom- 
bination/cell proliferation in plant cells. 

As the radSO mutation in our mutant Is caused by a DIVA 
insertion into the coding sequence of the AtRADSO gene, 
we cannot exclude that a truncated protein is produced in 
the mutant cells, which would thus not be true 'null' 
mutants. Preliminary data using antibodies to a C-terminal 
fragment of the AtRadSO protein shows the absence of the 
AtRadBO protein band on Western Wots of protein from the 
mutant cells (S. Daoudal and CI.W., unpublished results!; 
however these antibodies may not recognize a C-terminus- 
truncated form of the protein. It is thus formally possible 
that this putative truncated protein is sufficient to fulfil the 
function of the RadSO protein in cell viability. However, 
given the clear phenoiypes of the mutant for meiosis and 
MMS sensitivity, we think it extremely unlikely that the 
viability of the radSO" mutant cells and plants can be 
explained in this way. We note that the question of 
whether a given mutation is null may be posed for any 
insertional mutation that does not involve full deletion of 
the open reading frame of the gene in question (i.e. 
including all insertional mutations). Whether or not our 
mutant is 'null' does not alter our conclusions on the role 
of the AtRadSO protein in Arabidopsis and the phenotypes 
of the T-DNA insertion mutant described here. Full reso- 



lution of this question must await the identification of a full 
AtradSQ deletion mutant 

In conclusion, the plant Rad50 function appears to be 
analogous to that of yeast- Given the difficulties encoun- 
tered in isolating and studying the mouse rad50~ mutant, 
we believe that further study of radSO" mutant plant cells 
will permit us to clarify the role of this polyvalent protein 
and its multiple roles in the maintenance of genome 
integrity in both plants and animals. 

Experimental procedures 

Growth of eel! suspensions and plants 

The Arabidopsis thnitano cell suspension (TB7) was established 
by Axelos etaL (1992). Cells were grown in Garnborg's B-5 
medium (Sigma #G5893, St Quentin Fallavier, France) supple- 
mented with 30 g l -t sucrose and 200 mg l~ T naphthalene acetic 
acid (NAA) on a rotating platform (120 rpm) at 22°C with 16 h 
light/8 h dark. Cells in liquid culture were subculturcd at weekly 
intervals. Bacto-agar {Difco, Detroit, Ml, USA) was included in the 
cell-suspension medium at 0.8% w/v for solid media. Arabidopsis 
thiiiiom> (Columbia) seeds were sown directly Into damp compost 
and rrn inated in a greenhouse under while light (16 h light/8 h 
dark). For the isolation of root tissue, plants were grown 
aeroponically under the Same Conditions. 

Callus cultures were derived from homozygous radSO/radSO 
and heterozygous radSO/RAOSO plants using standard techniques 
as follows (J. Lucht and B. Hohn, personal communication). 
Leaves of young germinating plants were surface-sterilized in 
0.5% sodium hypochlorite, 0.05% Twccn-80 for 10 mirvat RT and 
rinsed several times with sterile water. The leaves were then cut 
up with a sterile scalpel and placed on callus induction medium 
(CIMk agar for 1 Week (22°C, 16 h tight). The lea&allus was then 
transferred to shoot induction modi urn (SIM) agar medium, and 
after 2-3 weeks successfully growing green callus was transferred 
to freah SIM medium (either solid or liquid) and Then maintained 
on this medium by regular subculture. C1M and SIM media were 
prepared as for the Garnborg's B-5 medium (above), except that 
the hormones (Sigma) differ CIM, 1 mg T 1 2,4-dichlorophenox- 
y acetic acid, 0.2 mg T 1 kinetin; SIM, 0.1 mg T 1 NAA. 1 mg I -1 6- 
bchzyliiminopufihc. 



Isolation of the AtRADSO cDNA and genomic DNA clones 

Using poly(A)+ RNA prepared from the cell-suspension culture of 
Arabidopsis, the complete AtftAOSO cDNA was isolated using tha 
marathon RACE PCR kit following the manufacturer's instructions 
(Clonrech). 

The genomic DNA clone was isolated from a genomic DNA 
library constructed with Sau3A1 partially digested DNA from tha 
cell suspension cloned into lambda F1XII (Stratagene Inc., La Jolla, 
CA, USA) following the manufacturer's protocol. 



Southern analysis 

Genomic DNA was isolated from eel (-suspension cultures follow* 
ing the method of Da II a porta era/. (1933). DNA (3ug) was 
digested with 50 units of the relevant restriction enzyme in a 
volume of 100 for 18 h at the recommended temperature. 

© Blackwell Science Ltd, The Plant Journal, (2001), 25, 31-41 
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Digested samples were phenol/chloroform extracted, ethanol 
precipitated, recuspended in TE and electroDhoreaod in 0.8% 
agarose/TAE gels. The gels were capillary blotted to Hybond N+ 
(Amersham, Orsay, France) positively charged nylon membrane, 
And hybridrred at G2"C to rad inactively labelled DNA probes 
according to Church and Gilbert {1984}. Filters were then washed 
(0.1 x SSC, 0.1% SDS, KfC) and outorodiogrophed. Probes were 
labelled with tx-^P dCTP using the Prime-it II Jdi (Suatagene) 
according to the manufacturer's instructions. 



RNA isolation and Northern analysis 

Frozen plant or call-culture tissue was homogenized in liquid N z 
with a mortar and pestle. Total RNA was then prepared using the 
TrizoJ reagent following the manufacturer's instructions (Gfbco- 
BRL, Cergy Pontoise, France). Poly(A)+ RNA was purified from 
total RNA using ths mftNA Direct kit {Dynal Inc., Campjegne, 
France). For Northern analysis, 30 jjl^ RNA per lane was fractio- 
nated on 0.8% agarose/formaldohydo gels, which wore blotted, 
hybridized to radioactivery labelled probes, and autoradiographed 
as for the Southern blots (above). 



Callus MNtS sensitivity tests 

OS ml of suspension cells were pipetted onto the surface of agar 
plates containing solid SIM medium and different concentrations 
of MMS (Sigma #M4016}. The plates were then incubated as 
described above, and resistance or sensitivity was scored visually 
2-3 weelcs later. MMS-containing plates were prepared immedi- 
ately before use. All MMS-contaminated material was quenched 
after use by soaking in 10% w/v sodium thiosulphatc. 
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APPEKWX D - motifs from Example 4, Pfa» added to Appendix A (2/14/02) 

Docket 1116K 

Serial Numb r 09/538,396 

1 !AA MULTIPLE ALIGNMENT 1.0 PileUP , . . ft ^ 

symbol comparison table: genrundata :blosum62 . cmp CompCneck: 1102 
Gapweigbt: 8 GapLengthWeignt ; 2 . . _ ia 

MSF: 1380 Type: P February 11, 2002 15:38 Check: 219 



Human RadSO 
Human RadSO 
Rat RadSO 
Mouse RadSO 
Arabidopsis RadSO 
Arabidopsis RadSO 
SKQ ID NO: 2 Maize RadSO 
Yeast RadSO 
C- elegans RadSO 




U63l39aa 
Z7S311aa 
NM022246aa 
U66887aa 
AAD15407aa 
AF168748aa 
1116 sid2 
X14 8l4aa 
Z75312aa 

FORMATTING; 

MOTIFS: Identified in Example 4 and Pfam analysis 
leucine zipper - Underlined, LZ noted in left margin 

Nucleotide binding motif - Italicized & underlined. NT in left margin 
Nuclear localisation signal - boxed, NLS in left margin 
RadSO Zn-Hook - Double-underlined, ZN noted in left margin 

i 

U63l39aa — 

Z7S311aa MLIFSVRDMF 
NM022246aa 

U66887aa 
AAD15407aa 

AFl687i8aa nt 

X14814aa 

Z75312aa — 1BKF 




US3139aa 

Z75311aa 
NM022246aa 

U668 87aa 
AADl5407aa 

| | || || 

1116 Bid2 

Xi48i4aa 
Z7S312aa 



1 
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APPENDIX D - motifs from Example 4, Pfam added to Appendix A (2/14/02) 
Docket 1116K 

Serial Number 09/538,396 



U63139aa 
Z75311aa 
NM022246aa 
US6887aa 
AAO154 07aa 
AFl68748aa 
1116 Sid2 
X14 814aa 
Z75312aa 




151 



200 



U63139aa 
Z753llaa 
NM022246aa 
U66887aa 
AADl54 07aa 
A*168748aa 
1116 sid2 
X14814aa 
2753l2aa 



U63139aa 
Z75311aa 
UlM022246aa 
U66887aa 
AAD15407aa 
AFlS8748aa 
1116 sid2 
Xl4 8l4aa 
Z75312aa 




LZ, NLS 



ISKRgEET 



251 

U63139aa 
Z75311aa KSYEU: 

NM022245aa KAYEU 

U66837aa 
AAD15407aa 
AFl68748aa 

1116 3 id2 

X14814aa 

Z75312aa 



300 




LIRV 



301 



350 



U63139aa 
Z7S311aa 
NM022246aa 
U66887aa 
AAD15407aa 
AF168743aa 
1116 sid2 
Xl48l4aa 
Z75312aa 




LZ 



2 
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APPENDIX D - motifs from Example 4, Pfam added to Appendix A (2/14/02) 

Docket 11X6E 

S rial Number 09/538,396 



351 
U63l39aa 
Z7531laa 
NM022246aa 
U6G887aa 
AADl5407aa 
AF168748aa 
1116 Sid2 
Xi48i4aa 
Z75312aa 



U6313 9aa 
Z75311aa 
NM022246aa 
U66887aa 
AAD154 07aa 
AFl68748aa 
1116 sid2 
Xl48l4aa 
Z75312aa 



U63139aa 
Z75311aa 

NM022246aa 
U66887aa 
AAD15407aa 
AF168748aa 
1116sid2 
Xl4814aa 
Z75312aa 



U63139aa 
Z7531laa 

NM022246aa 
U66887aa 
AAD15407aa 
AF168748aa 
1116si02 
X14 814aa 
Z75312aa 



U63139aa 

Z753Haa 
NM022246aa 

U668B7aa 
AADi5407aa ag< 
AP168748aa ao: 

1116 Sid2 

X14814aa 

Z75312aa KQ..TL; 



400 




3 
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APPENDIX D - motifs from 

Docket 11X6E 

Serial Number 09/538,396 



Example 4, Pfam added to Appendix A (2/14/02) 



601 



650 



U63139aa 
27531laa 
KM022246aa 
U66887aa 
AADl5407aa 
AFl68748aa 
1116 sid2 
xi4B14aa 
Z75312aa 



U63139aa 
Z75311aa 
NM022246aa 
U66887aa 
AAD15407aa 
AF168748aa 
1116 3 ld2 
Xi48i4aa 

Z75312aa 



U63 139aa 

Z7531laa 
HMQ22246aa 
U66887aa 
AAD15407aa 
AF16S743aa 
1116 si d2 
X14814aa 
Z753l2aa 




, JUDLEMD Fi 

flgEEDLRO iQLNV 






LFDVCGSQDF 
|K XiFDVCGSQOF 

k lfdvcgsqdf 
Ik lfdvcgsqdl 



lz, 2m 



QDNQ .|SK|K vWli-SENL 



701 



H 

LAPLSAKS 
750 



ilTQL 
ilTQL 




TD. . ENQS 
TD . , BNQS 
TD. . ENQS 
TD. .ENQS 1 



QTTLE_ 
SlgYDSYI. 




ZN 



LEIAERDS 
EESKSSG< 



751 
U63139aa 
Z75311aa 
NM022246aa 
U66887aa 
AAD15407aa 
AFl68748aa 
1116 Si d2 
X14814aa 
Z75312aa 



U63139aa 
Z75311aa 
NM022246aa 
U66887aa 
AAD15407aa 
AFl68748aa 
1116 si d2 
X14 814aa 
Z75312aa 



800 




ZN, LZ 



'DVGVgQQIi YEQTEEWBSR 



4 
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APPENDIX D - motifs from Example 4, Pfam added to Appendix A (2/14/02) 

Docket 1116B 

Serial Number 09/538,396 



US3139aa 
Z753Haa 
NM022246aa 
U668 87aa 
AAD15407aa 
AF168748aa 
1116 sid2 
Xl48l4aa 
Z75312aa 



U63139aa 
Z75311aa 
HM022246aa 
U66887aa 
AADl5407aa 
AF168748aa 
1116 S±d2 
X14814aa 
2753 12aa 




901 



REI/RK TjgSgliQMEKD 
EiYRKfV QEC^ELQKCS 

950 





qhlkstt n! 
Iqhlkstt ne 
hl: 

Icll 



RENSRMI 

KLgsfL NEXiGT: 



951 
u63139aa 
Z75311aa 
NM022246aa 
U66887aa 
AAD154 07aa 
AFl68748aa 
1116 si 02 
X14 814aa 
Z75312aa 



U63139aa 

Z75311aa 
NM022246aa 

U66887aa 
AAB15407aa 
AF168748aa 

1116 sid2 

X14814aa 
Z75312aa 



U63139aa 
Z75311aa 
KM022246aa 
U66887aa 
AADlS407aa 
APl6874 8aa 
1116 Sid2 
X14814aa 
Z75312aa 




1051 




1050 



EN YTQDG] 
EN HIQDG1 
|EH YIQDG1 

|en yiqdg: 
d eft 

E - . 



FEAKG 




1100 




LZ 



5 
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APPEND XX D - motifs from Example 4, Mam added to Appendix A (2/14/02) 

Docket 1116B 

Serial Number 09/538,396 



U63X39aa 
Z753llaa 
NM022246aa 
U66887aa 
AAD15407aa 
AFl68748aa 
1116si<12 
X148X4aa 
Z753l2aa 



1101 




1151 

U63139aa 
Z75311aa 
NM022245aa 
U66887aa 
AAD15407aa 
AF168748aa 
1116 sid2 
Xl4814aa 
Z753i2aa 



U63139aa 
Z75311aa 
NM022246aa 
U6€887aa 

AAD15407aa 
AFl68748aa 
1116 sid2 
X14 3l4aa 
Z75312aa 



U63139aa 
Z75311aa 
NM022246aa 
U66887aa 
AAD15407aa 
AF168748aa 
1116sid2 
Xl4814aa 
Z753l2aa 



U63139aa 

Z7S311aa 
NM022246aa 

U66887aa 
AAD15407aa 
AF168748aa 

1116 a id2 

Xi48l4aa 

Z75312aa 



1200 




x-z, NT 



6 
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APPENDIX d - motifs from Example 4, Pfam added to Appendix A (2/14/02) 
Docket 1116B 

Serial Number 09/538,396 

1351 1360 

U6313*aa Hk§NIDSC| *||VKCSjSSI* GFNVH- 

Z753iiaa H^^cWfl e|vkcs|ssl GFNVH- 

KM022246aa 11^1^1 E|VTCSS|NSIi GSYVH- 

U6G887aa IlKftMlJIdl EgVKCSlSSL GSYVH- 
AAJD15407aa 
APl68748aa 
1116 Sid2 

Xi48i4aa HkHOrIkI opwv^NRV TY 

Z75312aa CLgBBhGiB FBSKRYPDGT VKRVNTKRRP 




7 
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bioSCOUT 



i::Heip; 



Analysis Browser: ^l^eyelUp; 



1116E .r ad5Q.sid2 (Protein) 



Update 



Description 



1116E rad50 sid2 



■Edit 



Function DNA repair protein RAD50 (153 kDa protein). 

'V 

Direct assignment of functionality by homology to 

^$.$|P12753iRA.i0JfEAST 

in region 1 to 1314 for overall length of 350 (99% of query, 375% of hit, see the 
^Hgnrnent ). 

Functional class Transcription 



Extracted keywords 



iding. Coiled coil 



Features 
Ruromarv 



KlM'//AoWm Aiinnrst ™rn.«1 HI m" _Ki*r. /Ki r><Z POT TT/RFI? V/analv<li^?aiH^Q7d? ^1/2003 
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Homologies 



General 



query 

prosearch 

pfamsearch 

blocks 
seg 

phd-sec 

phd-acc 

coils 

biasdb 

function 

3D-structur& 



451 



902 



—m^l j! — lllliiil 



11 1 mi! 1 HM-HBHWH — H ! \l I !L L 



i in ■ 



Protein 

ESTs 
Patents 

Gene name 

Molecular weight 

Sequence length 

Isoelectric point 

Predicted cellular 
localisation (PHD 
and PreLoc) 



30 clear homologs 

170 homologs 
53 homologs 



152.50 kD 

1316 

6.04 

nuclear (94.7 %\ 



AlLBLASXWis 

AlLproiein BLAST 
hits 

AH .ESI 
All pa tent hits 



3D Structure 3D structure inferred by unlikely homology from residues 6 to 48 in 1E69-A 



Kttn.//«WO? riimnnt rnm«ftlA1 ai -hi n /hi a Sr.Ot IT/SRR V/analvsis?airi=9?.42 5/21/2003 
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View alignment 
P dbj1E 69[lE69-A structure 



Phytogeny 



Features 



Patterns 



Distribution 
Taxa 

M odel organisms 



Coiled coil region 



Low complexity 
region 

E rich region 

No significant hits 
detected by 

SMC family, C- 
teroiiual domain - 
region 

ABC transporter - 
region 

Intermediate 
filament protein - 
region 

Myosin tail - region 

Uncharactcrized 
ACR, COG1579 - 
region 

KE2 family protein - 
region 

Rad50 zinc hook 
motif region 

Protein of unknown 



34 species extracted from 175 Species 
homologous sequences. 

Chordata, Eukaryotae, Fungi, 
Planta 

Arabidopsis thaltana, 
Caenorhahditis elegans, 
Drosophila melanogaster, 
Homo sapiens, Mus musculus, 
Saccharomyces cerevisiae 

\ 

from 311 to 336, from 341 to 361, from 517 to 538, 
from 541 to 569, from 592 to 633, from 727 to 747, 
from 844 to 867, from 903 to 928, from 930 to 969, 
from 1036 to 1066 detected by {T_oilsl 

from 296 to 307, from 456 to 469, from 902 to 915 
detected by [segj 

from 212 to 964 detected by fbtasdbl 



from residue 1100 to 1314. Source: [pfa msearch ] 
Quality: (E-0.52) 

from residue 1141 to 1296. Source: [pfa msearch] 
Quality: (E-0.19) 

from residue 840 to 1118- Source: fptamsearch] 



from residue 237 to 1075. Source: {^famsearch] 
from residue 804 to 1015. Source: [pfa msearch] 

from residue 891 to 983* Source: [ pfamsear ch] 

from residue 673 to 726. Source: [pl^seatcJhL] . 
Quality: (E=9,9e-06) 

from residue 441 to 717. Source: [pfa msearch] 
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function, DUF2S9 - 
region 

Late embryogencsls from residue 548 to 625. Source: [pfamsearchj 
abundant (LEA) 
group - region 

RadSO zinc hook from residue 360 to 413* Source: [pfamsearch] 
motif region 

Heat shock protein from residue 219 to 275. Source: [ pfams earch] 
9 / 12 - region 

Poly(A) polymerase from residue 75 to 273. Source: [iifam search) 

central domain - 

region 

Sigma-70, won- from residue 112 to 260. Source: [pfams earch ] 

essential region - 

region 

RecF/RecN/SMC N from residue 2 to 182* Source: [pfamsearch] . Quality: 

terminal domain - (E=0.48) 

region 

LEUCINE_ZIPPER from residue 922 to 944. Source: [prosite] 
region 

from 915 to 937. Source: [prosite] 

from 908 to 930. Source: [prosite] 

from 849 to 871. Source: [prosite] 

ATP_GTP_A region from residue 34 to 42, Source: [prosite] 

No significant hits [blo cks database ] 
found in 

Comment No comment section. i^ffift:.:) 

Completed Task* StartXiol e User Comment Output l! nt ? ractiV( 

04.04.2003, dressvm bioSCOUT_default details^ 

16:44:19 - 

Permissions lii-ffffi- 
Alert Jobs Mf&i 
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v:-L : Projepts ! :;::i} ^Analyses , ; | ^ Admjh ^ | ^Atert.^| a£ttS^| . ; :;j.:^u^pja^ff 



Alignment: 1 1 16E,rad50.sid2 - pdb|lE69|lE69-A 



BLAST? - alignment of 1116E.rad50.sid2 against pdb|lE69|lE69-A 

chromosome segregation smc proteinfragment: smc fusion of the n- and c-terminal globular domains 
residues 1-152 and 1023-1 164; 

• This hit is scoring at : 0.56 (expectation value) 

• Alignment length (overlap) : 43 

• Identities : 44 % A 

• Scoring matrix : BLOSUM62 (used to infer consensus pattern) 

• Database searched : nrdb 

Q: 6 KML1KGXRSFDPDNKNVITFFK?LTLIVGPNGAGKTTIIECLK 48 

K:.:KG.:SF, F . : T . IVGPNG : GK: . II ; . : K 

H: 5 KLYLKGFKS FG — RPSLIGFS DRVTAIVGPNGSGKSNI I DAIK 45 



Legend of Alignment 

; positive score 

score between -2 and 0 



i 



bioSCOUT' 



Please report problems and feedback concerning bioSCOUT through the support interface 
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HEADER CHROMOSOME SEGREGATION 09-AUG-OO 1E69 

TITLE SMC HEAD DOMAIN FROM THERMOTOGA MARITIMA 

COMPND MOL_ID: 1; 

COMPND 2 MOLECULE: CHROMOSOME SEGREGATION SMC PROTEIN; 

COMPND 3 CHAIN: A, B, C, D, E, F; 

COMPND 4 FRAGMENT: SMC FUSION OF THE N- AND C -TERMINAL GLOBULAR 

COMPND 5 DOMAINS RESIDUES 1-152 and 1023-1164; ' 

COMPND 6 ENGINEERED: YES 
SOURCE MOL_ID: 1; 

SOURCE 2 ORGANISM_5CIENTIFIC: THERMOTOGA MARITIMA; 

SOURCE 3 ATCC: DSM3109; 

SOURCE 4 CELLULAR_LOCATION: CYTOSOL; 

SOURCE 5 £XPRESSION_SYSTEM: ESCHERICHIA COLI; 

SOURCE 6 EXPRESS I0N_SYSTEM_5 TRAIN : C4l{DE3); 

SOURCE 7 £XPRESSION_SYSTEM^CELLULAR_LOCATION: CYTOSOL; 

SOURCE a EXPRESS ION_SYSTEM^VECTOR_TYPE: PLASMID; 

SOURCE 9 EXPRESS ION_SYSTEM PLASMID: PH1S17 

KEYWDS SMC, STRUCTURAL MAINTENANCE OF CHROMOSOMES, COILED COIL 
EXPDTA X-RAY DIFFRACTION * 
AUTHOR J • LOWE, S - C - CORDELL, F - VAN DEN ENT 

REVDAT 2 27-MAR-01 1E69 1 JRNL 

REVDAT 1 09-AUG-OO 1E69 0 

JRNL AUTH J - LOWE, S. C. CORDELL, F. VAN DEN ENT 

JRNL TITL CRYSTAL STRUCTURE OF THE SMC HEAD DOMAIN: AN ABC 

JRNL TITL 2 AT PAS E WITH 900 RESIDUES ANT I PARALLEL COILED-COIL 

JRNL TITL 3 INSERTED 

JRNL REF J.MOL.BIOL- V. 306 25 2001 

JRNL RE FN ASTM JMOBAK UK ISSN 0022-2836 
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bioSCOUT 



■yf?:.Pr.bje?^l: : ^ | : ; >: Analyses. if\ 


Vdmin V^j .Alert .'■ | 








: "Help : : 






:?n: : Features 



Summary 

Searched query 1116E.rad50*sid2 against PFAM database. 



Hit 



pfem|hmm|Rad50^„hpj)JL. 
alignment 

r 

pfam|hmm| Rad5Q zd hook ■ 
aligT)inent 

C pfam|hmm|H SP9 HSP12 
alignment 



* j^X^ti.i.^ | i.Ai..fcAAAi.|i-fi^K it X 

alignme nt 



f" pfam|h mni|KE2 . 
alignment 

V pfam|hmm |PAP cent ral . 
3tigranent 

f~ pfam|hmm|ABC tran . 
alignment 

C pfam l hmmlSMC ^N , 

T p famlIimm|PUF]t64 . 
alignmeiU 

r pfamlhniiti|JSMCLCL. 
alignment 

I~ I^rn|hmm|sa 
al ignment 



Score Expect Description 



32.5 9.9e- Rad50 2inc hook 
06 motif 



4.2 1.8 Rad50 zinc hook 
motif 



Q Q to Method 
from 

673 726 HMMPFAM 



360 413 HMMPFAM 



•11.4 4.1 



-19.0 5.6 



Heat shock protein 9 / 219 275 HMMPFAM 
12- 

Late emhryogenests 548 625 HMMPFAM 
abundant (LEA) 
group - 

KE2 family protein - 



891 983 HMMPFAM 



-45.9 7.6 Poly(A) polymerase 75 273 HMMPFAM 
central domain - 

-47.7 0.19 ABC transporter - 1141 1296 HMMPFAM 



-69.5 0.48 RecF/RecN/SMC N 
terminal domain - 

-90.1 2.8 Uncharacterized 
ACR, COG1579- 

0-52 SMC family, C- 
116.7 terminal domain - 

4.9 Sigma-70, non- 
122 2 essential region * 



2 182 HMMPFAM 



804 1015 HMMPFAM 



1100 1314 HMMPFAM 



112 260 HMMPFAM 
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bioSCOUT 1.5.3 - 1 1 16E.rad50.sid2 Pa S e 2 oF2 

r pfm|femm]nUF.252_. - 9.3 Protein of unknown 441 717 HMMPFAM 
ali gnment 137.1 function, DUF25 9 - 

r pfamlhmm | filament . - 5.4 Intemiediate filament 840 1118 HMMPFAM 
alignment 203.9 protein - 

r pfamjhm m|Mvosin t ail . - 9.8 Myosin tail- 237 1075 HMMPFAM 



Please report problems and feedback concerning bioSCOUT through the support interface,.. 
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IP bioSCOUT 0 



•;y.'; : F : eatureg;; 

Aligranent: 1 1 1 6E.rad50,sid2 - pfam|hmrn|Rad50_zn Jiook 



HMMPFAM - alignment of 1116E.rad50.sid2 against pfam|hmm|Rad50_zn_hook 

Rad50 zinc hook motif 

• This hit is scoring at : 4.2 

• Scoring matrix : BLOSUM62 (used to infer consensus pattern) 

Q: 360 AHLTMKHERDSDIKNICTKHNL~GPVPEHPF-TNDVAMNLTNR%KARLSSLENDLL 4 13 

L_. : PV T . : . . .L. . » * K: . L . . L. . : L 

H: 1 galesekaelkkaieGleeeesscCPvCgReLgteeekkslikeykseldr-lpeelk: 57 



HMMPFAM - alignment of 1116E.rad50.sid2 against pfam|hmm|Rad50_znJiook 

Rad50 zinc hook motif 

• This hit is scoring at : 32.5 

• Scoring matrix : BLOSUM62 (used to infer consensus pattern) 

Q: 673 NFANGMREMLAPFEHLARKNHV — C PCC ERAF- T PDEE DE FVKKQRMQNS S TAERS K 726 

. ... : . L CP.C R. - T . : E: . E . : K: . : . : K 

H : 1 galesekaelkkaieeleeeesscCPvCgReLgteeekkelikeyks<aldrlpeelk 57 



Legend of Alignment 

: positive score 

score between -2 and 0 



Please report problems and feedback concerning bioSCOUT through the support interface^ 
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fe : pro|ects: ; : : ;| .^-Analyses ;::: : ;| g^dfnin^} Alert ! ;i ; : l^RS^ f^Tutorrai;; 



Alignment: 1 1 16E.rad50.sid2 - pfamjhmTn!HSP9_HSP12 

HMMPFAM - alignment of 1116E.rad50.sid2 against pfam|hmm|HSP9_HSP12 

Heat shock protein 9 / 12 - 

♦ This hit is scoring at : -1 1 .4 

• Scoring matrix : BLOSUM62 (used to infer consensus pattern) 

q. 219 — IX2AKKLRENIAQDQEKSDASKSQMEQLKEKICGTERF.ILQME 1 3SLDELRRLQGQIDI 27 S 

D A.K . A; : : . . D:SKS. . EQ : KEK: * * - : : - - TS D: . . :Q . D 
H: 1 MSDlaKKdFgeKakEklTPOSsKSTaEqvI^kvTDklDkvAgkvtsdddKStvQkAhDk 59 



Legend of Alignment 

: positive score 

score between -2 and 0 



Please report problems and feedback concerning bioSCOUT through the support interface. 
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bioSCOUT 



Alignment: 1 1 l6E.rad50.sid2 - pfamjhmm|LEA_l 



HMMPFAM - alignment of Hl6E.rad50.sid2 against pfam|hmm|LEA_l 

Late embryogenesis abundant (LEA) group - 

• This hit is scoring at : -14,3 

• Scoring matrix : BLOSUM62 (used to infer consensus pattern) 

Q: 548 LESS KDKLNE I VNEHKDKI KKVLrCf r np FEKDMKKE 1 NQAFW pVfrKEYNELRSKSQEAEQ 
::S:K:K::.:.:. K: K: . . :.DK.E .A . :KE- - R . K ; ; EA : . 

H: 1 MqSaKEKisnmAstAKekmditK AkadEKaEkatARtkeEkGilAhqrkkAKeAgA 

ELKFTQSKVTDAREOLTK 625 
E: . . : : K. . . A. E: : . 
eMdlheaKAehaaekesa 1 3 



Legend of Alignment 

: positive score 

score between -2 and 0 



.Help; 



'-Futures.: ;>j 



Please report problems and feedback concerning bioSCOUT through the su pport interfa ce. 
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;:; : ;h^ip;; 



Alignment: 1 1 16E.rad50.sid2 - pfam|hmm|KE2 



HMMPFAM - alignment of 1116E.rad50.sid2 against pfam|hmm]KE2 

KE2 family protein - 

• This hit is scoring at : -19.0 

• Scoring matrix : BLOSUM62 (used to infer consensus pattern) 

Q: 891 ASSILERFQKSEEELVLLiAEEKEQLIVEKKI-LEESLDPLSKEKfiS LL-Q 

.,.;L.:.Q: :::L :.»:K.QL : K . E L:.L.K . E . : Q 

H : 1 vqellaklqqlqqqlekvmtqJcaqlerqLkEaelvleELekldeDCkVYkXVGkvLVkvq 

EYNALKQKLDEEYHQLAERKREFQQELOALGR LNMKIKGYLDSKK 983 

: . . . ; . ;L--E- - -QL.E. ; L : L. K: : .L.S.. 

dkeeardeLeerleqleeeiktLekqccylekeleeleekXqellqsaa 109 



•Features' 



Legend of Alignment 

: positive score 

score between -2 and 0 



Please report problems and feedback concerning bioSCOUT through the sup port interface. 
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bioSCOUT' 



^Projects : ;| J^Araly&s 










S:Help:. 






::V:: : :rFe^tU^ : ;^;' 



Alignment: 1116E.rad50.sid2 - pfam|hmm|PAP_ceritral 



HMMPFAM - alignment of Hl6E.rad50.sid2 against pfam|hmm|PAP_centraI 

Poly(A) polymerase central domain - 

• This hit is scoring at : -45.9 

• Scoring matrix : BLOSUM62 (used to infer consensus pattern) 

Q: 7 5 xsNWPLQDPSTLEOCKFDDTFSAT-RYTKALEVTKK-LftKDQMQEIKTFRLK 

:P . PSTL ;KF : FS . R: . . .V: K ::,D,::E 
H : 1 RFE , radP . 1 YPnavpst lvekf Flvf sqWlrhnwpoPVlLkeinsdsieernlqvrvRF 

273 

E.rade 205 



Legend of Alignment 

; positive score 

score between -2 and 0 

Please report problems and feedback concerning bioSCOUT through the support interface. 
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bioSCOUT 



:=He!pv 



^:!:. : ;::FeatLireS-' 

Alignment: 1 1 1 6E.rad5(Xsid2 - pfam|hmm| ABCjran 



HMMPFAM - alignment of 1116E.rad50.sid2 against pfam|hmm|ABC_tran 

ABC transporter - 

• This hit is scoring at ; -47,7 

• Scoring matrix : BLOSUM62 (used to infer consensus pattern) 

Q: 1141 ■ TALDKALMRFHS MKM-EEINKII-* KELWQ 

H : 1 GevlalvGpNGaGKSTLLklisGllppteGt illciGardlsdlsklkerlerlrknigw 

Q T y RGQDIDVISINSOS EGAGTRSY ►--SYRVVMQTGDA--ELEMR- 

. . : - • R» : . : 2 : .G . .L R 

fQdptlfpnpeltvreniafglrlslglskdeqrdrlkkagaeellierlglgydhlldrr 

-GRCSAGQKVlasliiRLALAETFCLNCGILALDEPTTNLDgpnaF.3I.AAALLRIMEARK 

G..S.GQK R: A: A : L . LDEPT . . LD .: .A. LL, : ; » . : 

pgtLSGGqkQ RvaiARaLltkpklLlLDEFTagLD - - . . pagraqllellrelr 

GQeNFQLIVITHDER^-AHLIGQRQLAEKY 1296 

q , , : : : TTMH 1 . . 1 . . : ; . E , 
qq.ggTvllitHdldlldrlaDrilvledG 199 



Legend of Alignment 

: positive score 

score between -2 and 0 



Please report problems and feedback concerning bioSCOUT through the support interface. 
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|af^6jQfite" : : i:| ^;: An?'yses ■■:; ( jjfodfori:. -j-J :rAlert ;;| ^SRSl.I ■ tutorial j 



Features 



Alignment: 1 U6E.rad50.sid2 - pfam|hmm|SMC_N 



HMMPFAM - alignment of Hl6E.rad50.sid2 against pfam|hmm|SMC_N 

RecF/RecN/SMC N terminal domain - 

• This hit is scoring at : -69.5 

* Scoring matrix : BLOSUM62 (used to infer consensus pattern) 

Q: 2 STVDKMLIKGIRSFDpdNKNVl-TFFKPLTLIVGPNGAGKTTIliCLKL SCT-GEL 

. ::G.:S: .K-VI . F . -T . IVGPNG :GK: . I ; ; . : - . : :L 

H: 1 mylkrielegFKSYa. . gktvigpF.spgFtarvGPNGSGKSNilDAIlFVLGegrsa^lcL 

PPNSRSGHTFVH—DPKVAGeteTKGQIKLRffKTAAGKDW CIRSFQLTQKASKME 

. , . S A. 4 Fi ■ . : : : IR, ■ L . : : - : : . 

RaerladLIhkgsggkppan- . . ksAeVtit Fdnedkeniselgqhirdgpldeenpevt 

FKAIESV LQT 1 NCHTCEK VCLSYRCADMDREIPALMGVSKAV 

... ; I N .. .K V . . .A. : D £: . .L 

ItRrvyrlglgdGstSeYyiWknlrlNgkrvtklkevqeiLesagldiElatLangvsky 



RayaWCFHWDkELiLleqlswRaeYeaLapeiETCqlFLPElLtvSFgrGWeKetdYaEv 

. — LENVIFVHQDE 

: Q.E- 

LarnlTEUDkqLgYTqGrqKADLRl RAn^lPVEDvLSRGQLKLknpyf iilQG^yltrck^ 

SNWPLQDPSTLK KKFDDIF-S 132 

rhClYLvedlasmkPkeRreldeGLlellEEisGt 330 



Legend of Alignment 

: positive score 

score between -2 and 0 

Please report problems and feedback concerning bioSCOUT through the support interface* 
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■ .Features'" 

Alignment: 1 1 1 6E.rad50.sid2 - pfam|hmm|DUF 1 64 

HMMPFAM - alignment of ni6E.rad50.sid2 againstf pfam|Ihmm|BUF164 
Uncharacterized ACR 7 COG1579 - 
* This hit is scoring at : -90. 1 

o Scoring matrix : BLOSUM62 (used to infer consensus pattern) 

Q: 804 PTDTIDrHVI-lEIQQLVKEVEDLEm.DSSgrgVKSL£FJQLELNt ft LQRTRDTLIV£VDDL 
.: :1Q: : KE E IE . K-L::.: L. L : : . : .1* B ::L 
H: 1 mknelk-sLvkiqeidkekerLeerikei. . .pkeL-kkakellealkkeveeleqekeel 

RDQHRMLNEDMSSAQVRWHNArEEKVKA SSILERFQKSEEELVLIAEEKEQLI 

::: ;.L. : ..A EEK; . - . - .QK: : : : V.L.:E EQL 

keevkklekeiqeieekikka.EekmdeiktqrEYkALerElqkakdkevcirkeieqle 

VEKKLLEESLDPLSKEKESLLQEYNALKQECLDEEYHQLAERKREFQQELDalgRLNMKIK 
E K :EE.::.L.:E . :E... £..::-£: E. .: : -L- K. . 

eelkkieeeiaelkeeilkqEkeleeeeeevelEvrkikekvlell3lcre . . - elkektd 

GYLDSKKNEKLKELQGRHVL CH— — SQLQSCMAKQ-QRIS — — 

to - v ' • ^ cn . ; . ivr ; t t 

edllsfYERiiknkknlvxVPiennvCaGChiiljpsqfenkVrkePddivfCPyCSRILY 

AEL 1015 
E 

yee 235 



Legend of Alignmeot 
: positive score 

score between -2 and 0 



Please report problems and feedback concerning bioSCOUT through the support interface. 
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mm. 



bioSCOUT 





j v^Alirt:;! > : :;SRS;. 








:I>::l:eatur^':>;:: 



Alignment: 11 16E.rad50.sid2 - pfam|hmm|SMC_C 



HMMPFAM - alignment of 1116E.rad50.5id2 against pfam|hmm|SMC_C 

SMC family, C -terminal domain - 

* This hit is scoring at : -1 1 6.7 

• Scoring matrix : BLOSUM62 (used to infer consensus pattern) 

Q: 1100 ISKHKQELKLS «~QYKDIEKRYTNQFLQ^KTTEMflNECDLDRYYTALDKAL 

;.K. : EL :Y.::E:RY,- . :L: . .E . . K. L LDK. 

H: 1 lelceikeLgpVlnElqvNlkAieEYaeveeRyselvekledleeerkJcllevieeldkkr 

MRFHSMKMEEINKIIKELWQQ-TyrgqdidyisinsclsegagtrsysyrvvraQTGDAELE 

; : INK . KE: :Q: T GDAEL. 

leeFmeaf nklnknf kevFqeLt gGGdAeLr 

MRG RCSAGQKVLAS LIIRL ALAET FCLNCG ILALDEPT 

:.. S:G : : A. : .L ALA . F- : : ...LDE . 

LtDpdDPFssGieisArPPgKkwknlelLSGGEKtLtALALlFAIhkykPsPFYvlDEvD 

TNLDGPNAeSLAAALLRimeaRKGQENFQLIVlTHDERFAhligqrQLAEKYYRVSKDEN 
r i^v vf c 7X i ; p.; IVT : .... : A; . . . V , . : : 

AALDeaNV.sRvAnylk. , , , rersknaQFIVIsLRnnmm , ... * . ekADaLvGVymqdd 

QHSIIESQEI 1314 

S : S : : 
gv3kVislkL 210 



Legend of Alignment 

: positive score 

score between -2 and 0 



Please report problems and feedback concerning bioSCOUT through the su pport interf ace. 
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■SSI 



bioSCOUT' 



• Help- 
Features 

Alignment: 1116E.rad50.sid2 -pfam|hmm|sigma70_ner 

HMMPFAM - alignment of 1116E.rad50.sid2 against pfam|hmm|sigma70_ner 

Sigma-70, non-essential region - 

• This hit is scoring at : -122.2 

• Scoring matrix : BLOSUM62 (used to infer consensus pattern) 

Q: 112 AIESVLQTINPHTGEKVCLSYR CADMDREIPALMG *VSKAVI,ENVIFV 

. : : :L. . . : - £: LS .DDI.. . : : L. . 

H: 1 £PgtvdyIL9©YdRvetEegRLsDilsGy.iOPddgiapdeAPtathie3elaeepssekd 

. — '--HQDESN WPlqDPSTLKKKFDDI f s ATRYTKALEVIK- - 

. .D:S. P DP- . . : : :F. : : . . : . . K, : . : K 

daadaddddDEdEeeeessdddsEagdggp. - DPEeArerFgel - . reqlektkkalkKh 

KLHKDOMQEI KTFRLKL ENL-QTVKDQAHKLREN 

K . . . :L : L : -V: ; :R: . 

GRgskqalealeaLAeiEmpikLvPKQfDaLVervRgmldrvRkqERalMklCVrdArMP 

XAQ DQEKS DAS KS — QMEQLKEKICGTERE I LQME 260 

I . . . KS ... : . : E : : KE .1* E 

RkdFikisFpgnETnleWlekllkskkkyadeaLervkedllrcQqKLadlE 227 



Legend of Alignment 

: positive score 

3Core between -2 and 0 

Please report problems and feedback concerning bioSCOUT through the support jnterface. 
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: ;,,Help 






Features-." - : 



Alignment: 1 1 16E,rad50.sid2 - pfam|hmmiDUF259 



HMMPFAM - alignment of 1116E.rad50.sid2 against pfam|hmm|DUF259 

Protein of unknown function, DUF259 - 

♦ This hit is scoring at : -1 37. 1 

• Scoring matrix : BLOSUM62 (used to infer consensus pattern) 



Q: 441 DGQIQSKIESMSGILRRRKDKEKERDAASVEXiSK * FNLSRID 

:.G : : . R, , '- : V: . . . F :::D 

H: 1 adamRamLDqLMGRdasnRngdesrqkkkvkfdDpeVCrsyLvgfCPHDlFinTKinDnLG 

_ ERERHMQIEVERKTLALG fiftDYDSIISQ— - KRTEVYSLEQKIKVLLR 

: : . . . . E . . K: - ERD :..£:., L KIK 

pCpkvHdlklradYErasksrdyfpkfErdalef lersvsEvtqsPEiLELsEki kEkmk 



EKDIinrnADERVKLGLKKDALESSKDKLNEIVNEHK— DKIKKVLRGR 

E, : D : : . . K . . LE . ; : : : . .* . E . K : , L 

EAfv. . . .hDcdrridkakqrLeetgeeqtkeaaaskRqaeelaeldeekAstPqPvPAq 

NPFEKDM KKEINQAFWPVdKEYNELRSKSQEAEQ-— E 



: : ; ; A : - - EL : : K . : E . — : E 
PPssELPPPDPRtqEvIgkllaeaEaLGeeGkVdeaqklifi. kevEeLkakkkelesklde 

LKFTQSKVTDAreqltklrrdmdakrrtldsklqsilqisanvitofpkVLQDAMNECR 

A D-M. : : 

vrnaapssaqa * WslDeMgqqKLr 



. DEQKRLENFANG MREMLAPFEHLARK NRVCPCCER 

D..:RL.:. . G : RE . L A . . : K ... ER 

VCEvCGAyLsvlDadrRlADHfgGKlHLGYvkiReklaeLkkakaklkeevkktgrekeR 

AFTPDEEDEFVKKQRMQ 717 
. . . E. : : : - QR. . 
eereretektldgqRqh 335 



Legend of Alignment 

: positive score 

score between -2 and 0 



Please report problems and feedback concerning bioSCOUT through the support interface. 



httn://eshnc02.es.duDontxom:810U 5/21/2003 

PAGE 54/64 ' RCVDAT 1112612003 2:24:56 PM [Eastern Standard Time] * SVR:USPT0-EFXRF-1/4 * DNIS:8729307 ' CSID:515 334 6883 * DURATION (mm-ss):23-16 



11/26/03 WED 14:44 FAX 515 334 6883 PIONEER HI-BRED DSM 

bioSCOUT 1 -5.3 - Alignment: 1 1 16E.rad50.sid2 - pfamlhmm|filament 



31055 

Page 1 of 1 




bioSCOUT 0 
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Alignment: 1 1 16E.rad50.sid2 - pfam|hmm| filament 



HMMPFAM - alignment of 1116E.rad50.sid2 against pfam|hmm|filament 

Intermediate filament protein - 

# This hit is scoring at : -203.9 

• Scoring matrix : BLOSUM62 (used to infer consensus pattern) 

Q: EEIQ1ELN FLQRTRDTLIVEVDDLRDQH — R-MLNEDMSSAQ 

£::Q . LN FL L V: ::: LR. : . . . . : 

H: 1 nEKcqmQ.nLNDRLAsYIdKVRfLEqqNkeLevIcieelrqkqsrggpasvsrlyslYete 

VRwhNAREcKVKASSILERFQKSEEELVLLAEE-KEQLlV EKKL — L 

* . R. : . R. Q . : . L . . E: E . I* 

ie . , eLRrqidqltnerarlqlEicliUrealedf rkKyedKeDLaaQnqlkdlEialnt k 

EESLDPLSkEKESLLQEYNAL KQKLDEEY HQLAE 

E . . L . . E : : - . . : . . L . : : . LDE . - : LA 

eaeLaTaL . eRqeaEndl vgLRaQiAklEslaaftkdlDeaTLarvDLEnkvEsLqEElaF 

RKREFQQELDAL GR — LNMKIKGYL DSKKNEKLKELQgrhvlchSQLQsCM 

K:..::E:..L :N: : : - - . - -:-L:E:: :Q.: : 

LKknHeEE vk^Lqaqiqdt gqvnVEmDaar qqEwklDLt kaLr EiR aQYE . e i 

AKQQRISAELNKSKELLQGQGQLKRNIDDnlkYRKTKADVEQLTRDIESLEERLLSIgsL 
A::.R • AE . . : L ; Q . ,.RN : .R. .K. : : . :L-R. TtSLE .L S: 
AeknrqeaEewYksKl«eLqLaaarngea. . . IrsaKeEiUElRRqiQSLeiELcrsi . , JC 

SAIEADLKRHSQEKERLNSEFNRWQGTLSVYQSNXSKHKQELK—LSQYKDI • 

S. :: :: :ER. .:£. ::Q-.:S ::E:. L. : Y: : : 

sqnasLErqlaElEeryeaelaqyqalisqlEeeLqqlreEMarqLrEYQeLIidVKlaLD 

EKR 1118 

E.R 

iE I AT YRKLLEGEEs R 359 



Legend of Alignment 

: positive score 

score between -2 and 0 



S-ii'Featurfeis" 



Please report problems and feedback concerning bioSCOUT through the support interface. 
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Alignment: 11 16E.rad50.sid2 - pfam|hmm|Myosin_tail 



HMMFFAM - alignment of Ul6E.rad50 T sid2 against pfam|hmm|Myosin_tail 



Myosin tail - 

o This hit is scoring at : -555.1 

o Scoring matrix : BLOSUM62 (used to infer consensus pattern) 

Q: 237 ASKSQMEQLKEKICGTEREILQMETSLDE LRRL&GQI — — DIKATERS 

:.Q..:L:E:: . E. £ : Q:...L:: ::.L:.:I : -.ER: 

H: 1 dlerqkrQlaeqlkr]ceselsqlsll<.lEdEqalvaqLqI<k:ikQleaRIeELeEeLEaERa 

T - — LLTQQHEKLAALSEENEDT DEELM EWQTKFEE 

. - . .:L..LSE. : .EL. E . . : . EE 

ARaKaEkqRaDLsrELEeLsERLeEagGaTaaQiElnkKREaELaKLRrdLEEanlqhEe 

RIALLETKISKLVRDMDDE ASYSSVLSKQNSELTHEIGKLQAEADAHLTMKHERDS 

-:A.L..K ...:.::.D: : ..K:.S:L. E:..LA:.D: ...K 

alatLRKKHqdainElsdQieqLqKQKakaEKeKsqlqaEvddllaqlcisitKaKlnaEK 

DIKNICTKHNLGFVPehpfTNDVAMNLT — NRIKARLSSI.ENDLLDKKK5NEDQLDVLWK 
..K-: :: : -V. .:::... L . K:RL.S . : DL . : E Q: . L K 

kakqlEsQlsElqvK. . . . IdElqRqlnDltsqKsRLqsENsdLtrqleEaEaqvsqlsk 

HYLKINA RYSEVDGQIQSKIESMSGILRRRKDK EKERDA AEVELSK 

R E L. . D, E:E.:A .E :LSK 

IfCsqlesQLEeAkRslEEEsReRanLqaqlrnlehDlDslrEqlEEEsEAKaGl^RqLsK 

FN1 S RI DERERHMQI EVERKTLALGE — RDYDS 1 1 SQKRTE VYS LEQKIKVLLREKD UN 

N : E . -L E IS: :...K.. L 

an . aeiqqwrsKf EsECfalraEElEElKkKlnqkisEXEeaaEaanaKcssLEKtKsRLq 

RNAD ERVKLGLKKDALESSKDKLNEIVNEHKDKIKKVLRGRNPFEKDMKKEINQAF 

E : LE. :I: E K.K: . : : . ,:,F 

sElEDlqievEraitaaaseLEKKqknFDKilaEwKkkvdelqaElQtAqreaRnlstElf 

WpVDKEYNELRSKSQEAEQELKFTQSKVTDAREQLTKLRR DMDAKRRFLDSKLQS1L 

: • .EL: • : :...:E K ,Q.::,D..:QL : R : ; : . ,RR L: : : 
r > lKneleElkDqvEaLrRENKnLqdEi kDLtdqLgEgGRnvHELEKarRrLEaEkdELq 

QisANVDmfpkvLQDAMnKRDEQKRXiENFANgMREMLAPFE-HLARKnhvcpcceraf tp 

. A : .: A: : : :E.K L. :-.E .LA.K 

a. . ALeE AEaAL-eqeEsKvlRaqvE-lsqiRsEiERRL^EK 

DEEDEFVKKQRMQnsstaerskalAMESSNAEaLFQQLDKLRTIYDAYVKLVEETIP— - 
: EE E . :K...: A;ES .A. L : - K :. K ;E .1 

EEEf EntRKnhqr aiesLqas . LaEaEaKgKaEalRlKKKLEgdlnELE 

LAEKNLNGHLADES QECAQ AFD DLL -GVLAHVQMDRDAVE ALLQ PTDT I DRHVH E I QQ 1 vk 
:A . :.N: A: : : . .V: A E . ..:R.. . :Q. 
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iaLDhaNkanaeaqKnvKkyqqqvkeLQtqvEeeQRaredareqlavaERRataLqa . . . 

EVEDLEYALDSSGRGVKSLEeiqLELNFLQRTRDTLIVEVDDLRDQHRMLNEDMSSAQVR 

E:E'-L..AL:.: R. K..E -EL ;.L._: ..L Q.R.L. ::::.Q 

ElEELrvaLeqaeRaRKqAE. . . tElaEaservneLtaqnssLiaqKRKLEgelaalqsD 

WHNAREEKVKAssiLERFQKSEEELVLLAEElceqLlVEKKLLeESLDPLSKEKESLLQEy 

..A . E * A £R : K: : , ; , - LAEE L E; t : -L: L.K: ES ::E 
LDEavnElkaA. . -eERakkaqaDaarLaeE- . . LrqEQehs . qklErlRKqLEsqvKe . 

na LKQKL DE -E YHQLAE RKRE FQQEL DALGRLNMKI KGYLdS KWEKLKE 

L: : LDE E - L . R RE.:. ELD, R :.:.: L .K...::K£ 

. . LqvRLdEaEaaAlkgGKkvIqKLEaRVReLEaELdgEqRRhaetqKn.1 -RKaeRrvKE 

LQG RHVLchsQLQSCMAKOQrisaelNKSKellqqqgQ^KRWIDDNLKYRECTK- 

LQ : ; ; : LQ . . : K Q K K . . KR . : : : 

LqfQvEEDkKnle. . . rlQDLvDKLq aKiK tyKRQlEEaEEiaqi.nl 

ADVEQLTKD1ESLEERL-LSTGSLSAIE— AOLKRH 107^ 

.R: :E- . EER. : .SL:.;- A. :R. 
sKyRkaQreLEdAEERADqAEsslnklRqreaKsRrs 8 64 



Legend of Alignmeimt 
: positive score 

score between -2 and 0 



Please report problems and feedback concerning bioSCOUT through the support interface. 
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121058 



IHlinnimeirPfFam 



Refine: 



Query: 1116SID2 from: 1 to: 1316 WPDEF Case 1116 RadSO SEQ ID NO: 
2 

Scores for sequence family classification (score includes all 
domains) : 

Model Description Score E- 

value N 



Ra d5 Q_znJh.ook 
07 2 
HSP9_HSP12 

4.1 1 
LEA_1 
5.6 1 
Histone^HNS 

7.9 1 
ABC_tran 

0 .14 1 




RadSO zinc hook motif 

Heat shock protein 9/12 

Late ernbryogenesis abundant (LEA) group 

H-NS hi stone family 

ABC transporter 

RecF/RecN/SMC N terminal domain 

Transforming acidic coiled- coil -contain 
8 1 

Uncharacterized ACR, COG1579 
Tropomyosin 

SMC family, C-terminal domain 
Sigma-70, non-essential region 
Intermediate filament protein 



36.3 
■11 .4 
-14 .3 
•28 .3 
•46 .1 
•69,5 



-90.1 



■116.7 
-122 .2 
•203.8 



6.9e- 



http://C3hpc03.cs.dupont.com:800D/gcs-bin/rosutt.c^i/l I lesitnjimmcrp^^JSOJ.htm^projc^DrcssI I0702&noiiav=l (J of 2)1 1/25/2003 7:10:07 AM 

FAG! 58164 ' RCVD AT 11/26/2003 2:24:56 PM [Eastern Standard Time] * SVR:USPT0-EFXRM/4 * DNIS:8729307*CSID:515 334 6883 ' DURATION (mm-ss):23-16 



11/26/03 WED 14:45 FAX 515 334 6883 



PIONEER HI-BRED DSM 



@059 



ScqWcb Sequence Analysis 

ERM 

9.5 1 



Ezrln/radixin/moesin family 



Parsed for domains : 

Model Domain seq-f seq-t hmm-f hmm-t 



Rad50 zn hook 2/2 



673 726 



57 [] 



-236.2 

score E -value 
32.5 9.9e-06 



Alignments of top-scoring domains: 

Rad50_zn_hook: domain 2 of 2 , from 673 to 726: score 32.5, E = 9.9e-06 

* - >galesekaelkkaieeleeeesscCPvCgReLgteeekkelikeyks 
+ ++++++ 1 ++ +++ + CP+C+R + t++e+ e++k+++ 
673 NFANGMREMLAPFEHLARKNHV- -CPCCERAF-TPDEEDEFVKKQRM 



1116SID2 



716 



1116SID2 



eldrlpeelk<-* 
+ +++ e k 
717 QNSSTAERSK 726 
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Pfam: Mr* i i_l3NA_bin<l 



Wellcome Trvssi 



Protein families database of alignments and HMMs ^ Sanger 

>>j institute 



■Hosnv ^ttyword Searchf Protein Senrth J ©rtrwcaPTam j DMA search j Taxonomy j ftp 

Mrell_DNA_bind 



fel 



Accession number: PF04152 



Previous Identifiers: Mcrll_DNA_blnd; 

Mreli DNA-binding presumed domain 



The Mrell complex is a multi-subunit nuclease that is composed of Mrell, RadSO and Nbsl/Xrs2, and is involved in checkpoint signalling 
and DNA replication [1]- Mrell has an intrinsic DNA-binding activity that is stimulated by RadSO on its own or in combination with Nbsl 
[21 



INTERFRO description (entry IPR007281) 



The Mrell complex is a mulrj-subunit nuclease that is composed of Mrell, RadSO and Nbsl/Xrs2, and is involved in checkpoint signalling 
and DNA replication fMi=DLlNE:21984524 1. Mrell has an intrinsic DNA-binding activity that is stimulated by RadSO on its own or in 
combination with Nbsl rMEDUNE :2 Q3009l4l . 



Alignment 



® Seed (10) O Full (20) 
format 1 Coloured alignment 



Further alignment options here 

Help relating to Pfam alignments here 



Species Distribution 



i! View alignments & domain organisation by 
species 



Tree depth : t Show gH levels 



Domain organisation 



® Seed (10) O Full (20) 



[ As a Graphic { As a Tree 

Zoom |0 5 1 pixeis/aa. □ Bootstrap tree 
-ytewGr^hic 7 ^ |;-;.^ 



To find out about the NIFAS tree-viewer, click 
hers 



Phylogenetic tree 



® Seed (10) O Full (20) 



Download. tree ■; 



ATV Applet:.; 



The trees were generated using Quicktree 
To find out more about ATV phylogenetic tree-viewer dick 
here 
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Alignment method of seed 


Clustalw 


The Mre11 complex: at the crossroads of 


Source of seed members 


Pfam-B_3909 (release 7.3); 


Average Length 


200.3 


dna repair and checkpoint signalling. 


D' Amours o, Jackson SP; 

Nat Rot Mol Cell Biol 2002; 3:317-327. 


Average Void 


39 


Average Coverage | 28.59% 





HMMER build information 




Pfam Is [Download HMMl 


Pfam fs TDownload HMMl 


Gathering cutoff 


25.0 2S.0; 


25.0 25-0 


Trusted cutoff 


71.9 71,9; 


49.5 32.0 


Noise cutoff 


13.1 13.1; 


8.2 17.4 


Build method of HMM 


hmmbuiid -F HMMjs SEED 
hmmcalibrate -seed 0 HMMJs 


hmmbuiid -f -F HMMJs SEED 
hmmcaiibrate -seed 0 MMM_f5 
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